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IN THE CLAIMS: 

The text of all pending claims, (including withdrawn claims) is set forth below. Cancelled 
and not entered claims are indicated with claim number and status only. The claims as listed 
below show added text with underlining and deleted text with otr i k e through . The status of each 
claim is indicated with one of (original), (currently amended), (cancelled), (withdrawn), (new), 
(previously presented), or (not entered). 

Please CANCEL claims 6-9 and AMEND claims 10 and 1 1 in accordance with the 
following: 

1 . (previously presented) A method for collecting documents linked to each other from a 
network by crawling the network, comprising: 

collecting documents equal to or larger, in number, than a predetermined value from 
inside a community through the network based on a reference of the document; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting the documents equal to or larger in number than the 
predetermined value from inside the community. 

2. (original) The method according to claim 1, further comprising: 

computing a significance level indicating a level of significance of the collected document 
according to the reference of the collected document, and information about a position of the 
collected document in the network; and 

determining a document to be collected based on the reference and the significance 

level. 

3. (original) The method according to claim 2, wherein said document to be collected is 
determined separately for inside the community and for outside the community. 

4. (original) The method according to claim 3, further comprising: 
presenting a result of retrieving the collected documents separately for inside the 

community and outside the community. 

5. (original) The method according to claim 2, further comprising: 

determining whether or not the document is in the community according to information 
indicating the position of the document in the network. 



2 



Serial No. 09/880,070 



6. (cancelled) 

7. (currently amended) Tbe-A method accord i ng to c l aim 6 for collecting documents 
linked to each other from a network by crawling the network , furtbef-comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 

determining a document which is to be collected and is related to the field based on a 
reference to the positive sample document group and the negative sample document group by 
computing a reference score indicating a level at which a document is referenced only by a 
document in the positive sample document group based on the reference; and 



8. (currently amended) The-A method accord i ng to c l a i m 6 for collecting documents 
linked to each other from a network by crawling the network , wh e r e in comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 

determining a document which is to be collected and is related to the field based on a 
reference to the positive sample document group and the negative sample document group by 
computing a co-reference score indicating a level at which a document is referenced together 
with a document in the positive sample document group for a document referenced by a 
collected document referring to a document in the positive sample document group based on the 
reference; and 



be collected. 

9. (cancelled) 

10. (currently amended) The method according to claim 1 t further comprising: 
summarizing said-the collected docum e nt group documents based on a referencing 

expression used in the collected docum e nt group documents . 




collecting a document having a high reference score as the document to be 



collected. 




- collecting a document having a high co-reference score as the document to 
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11. (currently amended) The method according to claim 1, further comprising: 
assigning a keyword to the collected docum e nt documents based on a referencing 

expression used in the collected docum e n t documents . 

12. (original) The method according to claim 1, further comprising: 

not assigning a keyword based on the referring expression when the referencing 
expression is used regardless of a content of a referenced document. 

13. (original) The method according to claim 11, further comprising: 

counting a number of different documents referenced using the referencing expression; 

and 

not assigning the keyword based on the referencing expression when the number of 
different documents is equal to or larger than a predetermined value. 

14. (original) The method according to claim 11, further comprising: 

counting a reference frequency at which each collected document is referenced by the 
referencing expression when the number of different documents is smaller than a predetermined 
value; and 

determining whether or not the referencing expression is assigned as the keyword based 
on the number of different documents and the reference frequency. 

15. (original) The method according to claim 11 , further comprising: 

combining the keyword based on the referencing expression with a keyword extracted 
from text of the collected document, and a keyword extracted from information indicating a 
position in the network about the collected document. 

16. (previously presented) A method for retrieving documents linked to each other from 
a terminal belonging to a community in a network by crawling the network, comprising: 

transmitting information for retrieval of the documents to a server; and 

receiving the documents retrieved separately from inside and outside the community 

according to the information for retrieval together with information indicating a significance level 

for the community. 
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17. (previously presented) A document collection apparatus collecting documents linked 
to each other from a network by crawling the network, comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference of a collected document; 

a community determination unit determining whether or not the prospect is in a 
community in the network according to information indicating a position in the network of the 
prospect; and 

a document collection unit collecting the prospect from the network, wherein said 
document collection unit collects the prospect from inside and outside the community after 
collecting documents larger in number than a predetermined value from inside the community. 

18. (previously presented) A document collection apparatus collecting documents linked 
to each other from a network by crawling the network, comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference between a positive sample document group which is a document group related to a 
field and a negative sample document group which is a document group less related to the field; 
and 

a document collection unit collecting the prospect from the network. 

19. (previously presented) A computer-readable recording medium recording a program 
used to direct a computer to control collection of documents linked to each other from a network 
by crawling the network, comprising: 

collecting documents equal to or larger, in number, than a predetermined value from a 
community through the network based on a reference of the document; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting the documents equal to or larger, in number, than the 
predetermined value from inside the community. 

20. (previously presented) A computer-readable recording medium recording a program 
used to direct a computer to control collection of documents linked to each other from a network 
by crawling the network, comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 
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determining a document to be collected relating to the field based on a reference to the 
positive sample document group and the negative sample document group; and 
collecting the document to be collected from the network. 

21 . (previously presented) A computer data signal embodied on a carrier expressing a 
program used to direct a computer to control collection of documents linked to each other from a 
network by crawling the network, said program instructing the computer to perform the process 
comprising: 

collecting documents equal to or larger than, in number, a predetermined value from 
inside a community in the network based on a reference of the documents; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting documents equal to or larger, in number, than the 
predetermined value from the community. 
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